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PROCESSING ACTIVITY MASKING IN A 

DATA PROCESSING SYSTEM 

This invention relates to the field of data processing systems. More 
5 particularly, this invention relates to the masking of processing activity within data 
processing systems, for example, in order to increase security. 

It is known to provide data processing systems which manipulate secure data 
and for which it is desirable to ensure a high degree of security. As an example, it is 
known to provide smart cards which include a data processing system which 
10 manipulates secure data, such as secret cryptographic keys, and this data must be kept 
secret in order to prevent fraud. 

Known ways of attacking the security of such systems include timing analysis 
and power analysis. By observing the timing behaviour and/or the power 
consimiption behaviour of such a system in response to inputs, information 
15 concerning the processing being performed and the data being manipulated can be 
determined in a way that can compromise security. It is strongly advantageous to 
provide resistance against such security attacks. 

Viewed from one aspect the present invention provides apparatus for 
processing data, said apparatus comprising: 
20 a resiilt data value generating circuit operable to gen^ate a restilt data value 

uponexecutionof a data processing operation; and . 

a data processing register to which said result data value is written; wherein 

at least one data processing operation executed by said result data generating 

circuit is a conditional write data processing operation for which a result data value is 

25 not written to said data processing register when non-write conditions are met; and 

» 

further comprising 

a trash register to which a restilt data value may be written upon execution of 
said conditional write data processing operation when said non-write conditions are 
met. 

30 This invention recognises that there is a characteristic power consximption 

signature associated with a write to a data processing register and accordingly 
irrformation concerning the data processing being performed in association with 
conditional write data processing operations can be externally observed, i.e. 
information upon whether or not the conditional write did or did not occur. The 
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invention address this problem by providing a trash register to which a result value 
(which is preferably the true value) is written when the conditional write data 
processing operation meets its non-write conditions and a write would not otherwise 
occur. Accordingly, a write to a register whether the true register or the trash register, 
always occurs irrespective of whether or not the write conditions or non-write 
conditions are met and thus the security of this system is enhanced. 

The data register to which the write is normally made when the write 
conditions are met is preferably part of a register bank containing a plurality of such 
registers. I& this circumstance, a conmon trash register(s) may be used for dummy 
writes irrespective of how many real data registers are provided within the register 
bank. 

Preferably, the trash register is physically located as part of the register bank 
so as to avoid leakage of information by observing which part of a circuit is active at 
any given time. 

15 Whilst the present technique is applicable to a wide variety of systems, such a 

microprocessor based systems, digital signal processing system and the like, the 
invention is particularly well suited to systems including a processor core responsive 
to data processing instructions as these often incorporate conditional write instructions 
which give rise to the potential vulnerability addressed by this technique. 

20 It will be appreciated that a conditional write operation may be arranged to 

either occur or not occur when particular conditions are met. 

It will be appreciated tiiat the normal technical prejudice in this field is to 
reduce power consumption as much as possible. Accordingly, it would 
conventionally be considered that not performing a register write when a conditional 

25 write operation did not require one would be an advantageous feature since it would 
reduce the amount of power consumed. The present technique moves against this 
technical prejudice in the field by deliberately performing a trash register write and 
consuming power even though this is. not required for the real processing activities of 
the system. 

30 In preferred embodiments of the invention the trash register activity can be 

selectively enabled and disabled depending upon a control signal stored in a system 
configuration register. This allows programmable activity of the trash register activity 
such that power can be saved by disabling this feature when non secure processing is 
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taking place and yet security improved when required, such as v^eh handling 
cryptographic keys, decoding passwords etc. 

As mentioned above, whilst the trash register may be physicaUy located within 
the register bank with the normal data registers, in preferred embodunents the trash 
register is unmapped to a register number such that it cannot be specified by any 
program mstruction and accordingly is invisible as a register from the programmer's 
model point of view. The trash register is however visible in the sense that its activity 
can be enable or disabled in preferred embodiments by a configuration parameter. 

Viewed from another aspect the present invention provides a method of 
processing data, said method comprising the steps of: 

generating a result data value upon execution of a data processing operation, at 
least one data processing operation executed being a conditional write data processing 
operation, wherein 

a result data value is not written to a data processing register when non-write 
conditions are met but is instead written to a trash register. 

Embodiments of the invention will now be described, by way of example 
only, with reference to the accompanying drawings in which: * 

Figure 1 schematically illustrates a data processing system operable in a fixed 
timing mode and a variable timing mode; 

Figure 2 schematically illustrates a conditional programming instruction; 

Figure 3 is a flow diagram schematically illustrating part of the processing 
operations performed by an instmction decoder operating in accordance with the 
present techniques; 

Figure 4 schematically illustrates the execution of a conditional branch 

instruction in a fixed timing mode; 

Figure 5 is a diagram schematically illustrating a data processing system 
including multiple circuit portions which may be selectively enabled to perform 
required processing operations or dummy processing operations; 

Figure 6 schematically illustrates a circuit portion and its associated dummy 
activity enabling circuit which may be responsive to both required enable signals and 
. random dununy activity enable signals; 

Figure 7 schematicaUy illustrates a linear shift back feed register which may 
be used as a pseudo-random signal generator: 



DYC Ref. P016007GB 
» ARM Ref: P268 

Figure 8 is a flow diagram schematically illustrating control of a circuit 
portion to perform required processing activity and dummy processing activity; 

Figure 9 schematically illustrates a portion of a register bank including 
multiple data processing registers, multiple dummy registers, multiple shared dummy 
5 registers and a non-mapped trash register RT to which a dummy register write is made 
when a conditional write operation fails its condition codes; 

Figure 10 is a flow diagram schematically illustrating a register write 
controlling circuit which seeks to balance the number of high to low and low to high 
transitions occiirring when a register write occurs; 
io Figure 1 1 is a table illustrating the relationships between bit transitions for a 

particular bit within the data register and three further registers which are configured 
to balance the high to low and low to high transitions occiuring in association with a 
register writet; 

Figure 12 is a flow diagram schematically illustrating control of writing to a 
15 trash register on a condition code fail of a write operation; 

Figure 13 is a diagrana schematically illustrating a system having multiple 
execution mechanisms for an instruction and pseudo random selection of the 
execution mechanism employed for at least some instructions; and 

Figure 14 is a flow diagram schematically illustrating control of the system of 
20 Figure 13. 

Figure 1 illxistrates a data processing system 2 including a processor core 4, a 
coprocessor 6 and a memory 8. 

In operation, the processor core 4 fetches instructions and data from the 
memory 8. The instractions are fed to an instmction pipeline 10 where they occupy 
25 successive pipeline stages such as, for example, fetch, decode, execute, memory and 
write back on successive processing cycles. Pipelined processors are in themselves 
well known as a way of effectively executing a number of program instructions in a 
partially overlapping fashion in order to improve processor performance. 

30 The data values read from the memory 8 by tiie processor core 4 are supplied 

to a register bank 12 from where they may be manipulated imder program instruction 
control using one or more of a multiplier 14, a shifter 16 and an adder 18. Other data 
manipulating circuits may be provided, such as circuits performing logical operations, 
such as ANDs, Ors, count leading zeros etc. 
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■ Figure 1 also iUustrates an instruction decoder 20 within the processor core 4 
which is responsive to a program instruction within the instruction pipeline 10 to 
generate execution control signals that are appUed to the various processing elements, 
such as the register bank 12. the multiplier 14. the shifter 16 and the adder 18 m order 
to control the data processing operations perfonned. As an example, the control 
signals generated by the decoder 20 may cause the appropriate operands to be read 
from the register bank 12 and suppUed ^d acted upon by the appropriate ones of the 
multiplier 14, Ihe shifter 16 and the adder 18 so as to generate axesult which is written 

back mto the register bank 12. 

The coprocessor 6 is a system configuration coprocessor containing a number 
of configuration registers 22 which may be written under program control to set up 
configuration controlling parameters. TTiese configuration controlling parameters can 
specify many aspects of the configuration of the processing system 2, such as for 
example the endianess and the like. Included within one of these configuration 
controlling registers 22 is a bit which specifies whether or not the processor core 
should operate in a fixed timing mode or a variable timing mode. This bit is 
Ulustrated as being suppUed as an input to the mstruction decoder 20, but it will be 
appreciated that this bit may also be suppUed to various other points within the 
processor core 4 as required to control their behaviour. In dependence upon this 
fixed/variable bit, the processor core 4 operates in eiliier a fixed timing mode or a 
variable timing mode. When in the fixed timing mode at least one program 
instruction which has a variable timing (i.e. takes a variable number of processing 
cycles to complete) in the variable timing mode, is instead forced to have a fixed 
timing (e.g. take liie maximum possible number of processing cycles to con^lete 
irrespective of whether or not it could have been suppressed in its entirety or 
completed in less than the maximum number of processing cycles. As the instruction 
decoder 20 is primarily responsible for decoding the program instructions and 
instmcting the activity of the other elements of the processor core 4, the instruction 
decoder 20 can take the major role in controUing the processor core 4 to either operate 
in the fixed timing mode or the variable timing mode. Not all variable timing 
instmction need be provided with a fixed timing mode type of operation. 

It will be appreciated that in Ihe above description a single bit in the 
configuration controlling register 22 is shown as switching between fixed and variable 
timing modes. Alternatively, multiple bits within the configuration contirolling register 
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22 may be provided to separately enable and disable the fixed or variable timing 
behaviour of different types of instruction, such as conditional instruction behaviour, 
uniform branch behaviour, disabling early terminate, etc. 

Figure 2 schematically illustrates a conditional instruction 24. This 
. 5 conditional instruction may be part of an instmction set which includes only some 
conditional instmctions or part of an instruction set, such as the ARM instraction set, 
which is substantially fully conditional. The condition codes 26 encode a set of 
processor state conditions in which the associated instruction either will or will not be 
executed. As an example, the condition codes 26 can be arranged to specify that the 
10 instmction 24 will not execute if the condition codes currently set in tiie system 
indicate a zero result, a carry has occurred, an overflow has occurred or the like. This 
type of instruction can be utilised to provide efficient program coding. The 
fixed/variable bit at least partially suppresses the conditional behaviour in that the 
instruction will execute irrespective of its condition codes, but may not write its result 
15 in a way that has an efifect upon the processor state. 

Figure 3 is a flow diagram schematically illustrating part of the processing 
operations performed by the instruction decoder 20. It will be appreciated that Figure 
3 illustrates these processing operations as a logical sequence, whereas in practice 
these processing operations may be performed at least partially in parallel or in a 
20 different order. 

At step 28, the instruction decoder 20 waits for a new instruction to execute. 
When a new instruction is received processing proceeds to step 30 at which the 
condition codes associated with tiie new instruction are read. At step 32 these 
condition codes are compared with the currentiy existing condition codes in the 
25 • system. These condition codes currentiy existing in the system are the result of 
previous processing activity, either in the immediately preceding instraction or in the 
last instruction which would have updated those condition codes. 

At step 34, a check is made for a match between the condition codes 26 of the 
current instruction being executed and the existing condition codes. If a match does 
30 not occur, then processing proceeds to step 36 where execution of the current 
instruction is started. It will be appreciated that Figure 3 illustrates a system in which 
execution occurs when a match does not occur, but altemative embodiments could 
equally well be ones in which execution occurs when a match occurs. 
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Following step 36, processing proceeds to step 38 whei:e a check is made as to 
whether or not early termination of the instraction is possible. This early termination 
may, for example, be because one of the operands has a particular value, such as zero 
or unity, or on subsequent processing cycles that a particular partial result has been 
produced. If early temiination is possible, then processing proceeds to step 40 where 
a check is made as to whether or not the processor core 4 is currently operatmg in the 
fixed or variable timing mode. If the processor is in the variable timing mode, then 
processing proceeds to stfep 42 and the instruction concemed is early terminated with 
the result being retiined as appropriate and processing returns to step 28. 

If the determination at step 40 is that the system is in the jSxed timing mode, 
then processing proceeds to step 44 irrespective of Ihe fact that early temiination is 
possible. Step 44, which may also be reached by a determination at step 38 that early 
termination is not possible, executes the instruction concemed for one processing 
cycle. In the case of a multicycle processing instruction, such as a multiplication, a 
divide, an add or a subtraction, these typically take several cycles to execute and so 
after step 44 processing proceeds to step 46 at which a determination is made as to 
whether or not the maximum number of cycles associated with that instruction has yet 
been performed. If the maximum number of cycles has been performed, then the 
result will have been generated. If early termination was possible and the system was 
being forced to continue to execute for further processing cycles, then step 46 will still 
indicate that this forced execution should cease when the maximimi possible number 
of processing cycles for that type of instruction has been reached. If the maximum 
number of processing cycles has not yet been performed, then processing is retumed 
to step 38. 

If the match tested for at step 34 was positive, then processing proceeds to step 
48. In this example, the positive detection of a match at step 34 indicates ttiat 
execution of the particular instruction should be suppressed. Step 48 determines 
whether or not the system is currentiy in the forced execution mode. If in the forced 
execution mode, then processing proceeds to step 50 where a forced dummy 
execution of the instruction will occur. When dummy execution is perforaied the 
result is written to a trash register (see trash register 51 in Figure 1), rather than the 
destination specified in the instruction itself so as to prevent the state of the system 
being modified by a program instruction which should not have executed as it should 
have been suppressed whilst also keeping a substantially unaltered power 
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consumption. If at step 48 the determination is tiiat the system is not in the forced 
execution mode but is in the variable timing mode, then processing bypasses step 50 
and returns to step 28 with the program instruction being suppressed in the normal 
way. 

It will be appreciated that Figure 3 illustrates a generic system in which 
. dummy execution is applied to all condition code failed instructions and all early 
termination of instructions is suppressed. In practice, it is also possible for these 
techniques to be applied to a subset of conditional instructions and instructions 
capable of early termination. The multiple configuration controlling bits mentioned 
above could be used to selectively tum on features such as early terminate 
suppression, but not others, such as dummy execution following a condition code fail. 

Figure 4 schematically illustrates the execution of a conditional branch 
instruction in the fixed timing mode. A sequence of instractions AB are executed 
until a conditional branch instruction BEQ (branch upon equal) is reached. This 
instruction encodes the behaviotu: that the specified branch will be performed if the 
flag indicating an equal res^t fi-om previous processing is set and will be suppressed 
if this flag is not set. When the condition codes are passed, i.e. a condition code 
match, then the branch is taken and processing proceeds to instructions X, Y, etc. If 
the condition codes fail, then instead of being suppressed in its entirety, the BEQ 
instruction performs a branch to the inmiediately following instruction C. This is the 
same instruction which would have been reached if the BEQ instraction had been 
suppressed and not executed at all. However, in the fixed timing mode, the BEQ will 
have executed consuming the same number of processing cycles irrespective of 
whether or not the condition codes were passed or failed. This helps obscure the 
results of data processing operations previously performed jfrom a person trying to 
gain access to secure data. 

Figure 5 schematically illustrates* a data processing system 52 in the form of a 
programmable processor core which responds to program instmctions I and ' 
manipulates data D. The data processing system 52 includes a register bank 54, a 
multiplier 56, a shifter 58, an adder 60, and arithmetic logic unit 62, a load store unit 
64, a data pipeline 66, and instmction decoder 68 and a random clock gating circuit 
70. A system configuration coprocessor CP 15 72 is coupled to the processor core. 
The system configuration coprocessor 72 includes a system configuration register 74 
holding multiple flag values which respectively serve to enable or disable the pseudo 
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random dummy activity of dijSerent circuit portions of the data processing system 52. 
It wiU be appreciated that the data processing system 52 wiU typically include many 
more circuit elements, but these have been omitted for the sake of clarity from Figure 

5. " . 

5 Associated with the multipUer 56 is a dummy activity enable circuit 76 which 

serves to enable dummy activity in the multipUer 56 as appropriate or alternatively 
pass the required activity enable signals to activate the multipUer 56 when the 
program instruction being executed so requires. Sunilar dummy activity enables 
circuits 78, 80, 82, 84 are associated with the respective other circuit portions 58, 60, 
10 62, 64 previously mentioned. 

In operation, instiiictions to be executed are to be passed to tiie instmction 
pipeline 66 and on to tiie instmction decoder 68 to generate instiruction driven enable 
signals which are applied to respective circuit portions, these enable signal serve to 
select the data patii tiirough tiie data processing system 52 and to activate the ckcuit 
15 portions concerned to read tiieir inputs, to perform the specified processing and to 
generate their associated output signals. As an example, a multipUer-accumulate 
operation might read data values from tiie register bank 54, apply tiiese to the 
multiplier 56 and adder 60 and tiien write back tiie result to tiie register bank 54. Thus, 
tiie register bank 54, tiie multipUer 56 and tiie adder 60 would all be subject to 
20 required activity enable signals which botii enabled tiieir operation and selected tiiem 
to form a complete data patii. The difEerent circuit portions have different power 
consumption characteristics and timing characteristics such tiiat external observation 
could reveal which instruction were being executed by observing such parameters. 
Accordingly, pseudo random dummy activity of tiie otiier ckcuit portions not requked 
25 for tiie instruction being executed is also enabled. Thus, even tiiough tiie shifter 58 
may not being used by tiie particular multipUer accum^ate instruction being 
executed, it may nevertheless be pseudo randomly enabled such tiiat it wiU consume 
power by shifting whatever value is appUed to its input. Its output latches will not be 
enabled to avoid this dummy activity altering tiie circuit state an undesired way which 
30 may interfere witii required operation e.g some circuit portions might assume 

persistence of output values. The dummy activity is enabled for time periods matching 
tiie normal operation timings for tiie circuit portions concerned. 

The random clock gating circuit 70 serves to receive a pluraUty of pseudo 
random enable signals for different respective circuit portions and gate tiiese for 




X- - DYC Ref: P016007GB 
ARMRef:P268 

applying to respective circuit portions under the control of configuration parameters 
read from the system configuration register 74 within the system configuration 
coprocessor 72. These configuration flags may indicate that dummy activity should 
be enabled for the shifter 58, ALU 62 and multiplier 56, but not for the adder 60 or 
5. the load store unit 64. The different pseudo random enable signals allow different 
pseudo random characteristics to be applied in a manner that can match these 
respective circuit portion concemed. As an example, there may a different minimum 
enable time associated with the normal timing of the different circuit portions. 

At an overall level, it will be seen that the instraction decoder 68 will serve as 

1 0 a required activity enabling circuit which will enable the circuit portions required to 
perform the data processing operation specified by the instraction currentiy being 
executed, superimposed upon this required activity, various dummy activities within 
the other circuit portions will be enabled/stimulated by the dummy activity control 
circuitry provided in various places in the rest of the data processing system 52. The 

15 dummy activity serves to mask the power consumption and timing characteristics 
associated with the required activity. 

Figure 6 schematically illustrates a circuit portion 86 which may be subject to 
both a required enable signal en and a dummy enable signal md. This circuit portion 
86 can be considered as a sequence of latches between which processing logic 

20 manipulates data values. When genuine reqxiired activity is necessary, all of the 
latches that provide a data path through the circuit portion 86 are enabled and the 
reqxiired processing will be performed between the input latches and the output 
latches. When dummy activity is instracted, then only the input latches and the 
intermediate latches are enabled. Thus, a data path is not provided through the fiill 

25 circuit portion and the output values generated by that circuit portion are not altered. 

Figure 7 illustrates a linear feedback shift register of the type which may be 
used to generate pseudo random clock signal. These clock signals can be provided to 
the random clock dating circuit 72 of Figure 5. Separate pseudo random signal 
generators may be provided for the different circuit portions. The fixed clock 

30 firequency associated with the different pseudo random generators may be altered so 
as to match the characteristics of the circuit portion concemed and fiarther obscure the 
masking operation as required. 

Figure 8 schematically illustrates the control of an enable signal for a circuit 
portion. At step 88 a determination is made as to whether or not an enable signal en 
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has been received jfrom the instruction decoder 68. If such an enable signal has been 
received, then processing passes to step 90. An enable signal from the instruction 
decoder 68 indicates that required processing operation is necessary in accordance 
with a genuine program instruction being decoded. Thus, step 90 enables the input, 
output and clock signal to the circuit portion concerned. If at step 88 no enable signal 
en is received from the instruction decoder, then processing passes to step 92 where a 
detenniaation is made as to whether or not dummy operation of that circuit portion is 
permitted. If dummy operation is permitted, then processing proceeds to step 94 at 
which the inputs and clock to the circuit portion are enabled, but the outputs from the 
circuit portion are not enabled. The circuit portion then undertakes dummy activity. 
If tiie determination at step 92 was that dummy operation was not permitted, as 
indicated by the system configuration parameter(s), then processing terminates by 
passing to step 94. 

It will be appreciated 1hat the process illustrated in Figure 8 is in the form of a 
sequential flow diagram. In practice, this control may be performed in a different 
sequence and use circuit elements spread throughout the data processing system 52, 
The operations illustrated as being sequentially performed may in fact be performed 
in parallel or the control functions modified. At an overall level an individual circuit 
portion will be enabled to perform its normal required operation in response to an 
appropriate program instmction and will be enabled to perform dvramay activity when 
permitted by the associated configuration parameter, x 

Figure 9 schematically illustrates a register bank 96. This register bank is 
based upon the ARM processor programmer's model for user mode operation in 
accordance with processors designed by ARM Limited, Cambridge, England). In 
practice, further registers may be provided for other processor modes, but these have 
been omitted for the sake of clarity. The normal data registers RO to R15 are provided 
for holding data values. The registers R13, R14 and R15 typically serve to store tihte 
program counter value, the branch return address value and the stack pointer, which 
tend to be none seciirity related data values. Accordingly, transition balancing upon 
data writes is not necessary for R13, R14 and R15. A trash register RT is provided 
within the register bank 96 for use in association with conditional writes which fail 
their condition codes thus, a conditional write instmction which fails its condition 
code would not normally make any write. However, with this system such a failed 
conditional write instruction nevertheless writes a data value to the trash data register 
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RT even though the condition codes have failed. This masks any diSerence in power 
consumption or timing that might be associated with condition code failure or 
condition code passing of a conditional write operation. The trash data register RT 
does not appear in the programmer's model in a way that enables it to be addressed 
with a register specifying operand within an instruction. 

As well as the trash data register RT, further registers 98, 100 are also 
provided for the purpose of balancing the high to low and low. to high transitigns. 
Dedicated dummy registers 98 are provided in respect of the data registers RO to R12 
as well as the trash data register RT. Shared dummy registers 1 00 are provided for 
storing an exclusive OR value as well as the inverse of the exclusive* OR value in 
response to each write to a data register subject to the transition balancing technique. 
A register write control circuit 102 serves to generate the appropriate data values to be 
written to the fiirther registers 98, 1 00 in response to a data value write to a data 
register. This symmetric write control is selectively enabled and disabled by an 
•appropriate system configuring controlling flag signal from the system configuring • 
coprocessor 72. 

Figure 10 is a flow diagram schematically illustrating the operation of the 
register write control circuit 102. At step 104 the circuit waits for a register write 
operation to be instructed. Step 106 determines whether this register write is to one of 
the data registers or the trash data registers RT for which the symmetric write control 
system is applied. If the register write is not to such a register, then processing 
proceeds to step 108 and a simple write of the required data value X is made to one of 
the registers R13, R14 and R15. 

If the register to which the write is being made is potentially subject to 
symmetric register writing then step 110 serves to determine whether or not this 
feature is currently enabled. If this feature is not currently enabled, then processing 
proceeds to step 108.. If this feature is enabled, then processing proceeds to step 1 12. 

At step 112, the register controls circuit calculates for each bit position within 
the data value a value being the inverse exclusive OR of the current bit being written 
at that position and the previously stored bit at that position which is then exclusive 
Ored with the previously stored dummy register value for that bit position (see Figure 
1 1). Tlie register control circuit 102 also calculates the inverse of the determination as 
well the inverse of the bit being written as the data value to the data register. These 
values are calculated for all of the bits being written (e.g. 3 dummy 32-bit values). 
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At step 1 14 the data value is written to the data register in a similar manner to 
step 108. At step 1 1 6 the three further values determined for each bit position within 
the registers concerned are written to the three further registers. Steps 1 14 and 1 16 
take place simultaneously. As will be discussed in relation to Figure 1 1, this results in 
a balance number of high to low and low to high transitions and thus power 
consumed. 

Figure 1 1 illustrates a table of possible bit values before and after a data write 
operation. The data value is being written to a register Rn, vmich is a register to 
which the symmetric write operation fiinctioh is appUed. The values at time t and 
time t+1 are iUustrated. The inverse of these values is simply determined. Each of 
the data registers subject to this symmetric operation is provided with a dedicated 
dummy register 98 which stores tiie inverse of the data value curr^itiy held in the data 
register. 

The shared dummy registers 100 are indicated in Figure 11 as registers Rd. 
For each bit position on within the shared dummy register Rd the new value to be 
written at that bit position when a data write occurs is determined by tiie function 
shown at the bottom of Figure 1 1 . This function ensures that when a change does not 
occur in the data value and the inverse of the data value, then a change is guaranteed 
to occur in the corresponding bit within the shared dummy register and accordingly its 
inverse. The table shows the changes in the shared dummy register values which 
occur when the data value does not change and the shared values in the dummy 
register values not changing when the data value does change. Thus, there is a 
guaranteed fixed number of transitions for every write, i.e. a balanced equal number 
of transitions high to low and low to high. 

Figure 13 is a flow diagram illustrating the action of the dummy data register • 
RB to provide writes when a write operation fails its condition code(s). At step 1 18 
tiie contirol logic waits for an instruction to be received. This control logic may be the 
insti^ction decoder 68 or otiier logic. Step 120 determines whetiier or not tiie 
instruction failed its condition codes. If tiie instinction does not fail its condition 
code, tiien it is normally executed at step 122 and makes its write to tiie register 
specified by tiie register operand witiim tiiat instnction. If tiie insti^ction does fail its 
condition codes, tiien processing proceeds to stej) 124 at which a determination is 
made as to whetiier or not dummy data register writes are enabled. If these are not 
enabled, then processing termmates. If dummy data register writes are enabled, tiien 
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processing proceeds to step 126 at which a write of the data value calculated by the 
condition code feiled instruction is written to the trash data register RT even though 
the condition codes failed. This balances the power consumption and timing 
irrespective of a condition code pass or a condition code fail. It will be appreciated 
that the trash data register RT is also subject to the transition balancing mechanisms 
previously discussed. 

Figure 13 illustrates a data processing system 128 in which multiple 
instruction execution mechanisms are provided for at least some instructions. The 
data processing system 128 is one which supports the native execution of at least 
some Java bytecode instructions. This type of data processing system and native 
execution is described in published PCT PatMt Application Number WO-A- 
02/29555. The disclosure of this published application as a whole and in respect of 
the native hardware execution and selective software emulation of more complex Java 
bytecodes in particular is incorporated here in by reference. 

The Java bytecode decoder 130 may be selectively enabled and disabled by an 
input signal. When the Java bytecode decoder 130 is disabled a received Java 
bytecode will trigger an exception which starts execution of software emulation code 
for handling Java bytecodes using the native ARM Thumb instruction sets. This 
support code is stored within memory in area 132 as illustrated. The Java bytecode 
program 134 is also stored within memory. When it is desired to obscure the nature 
of the Java program execution, the Java bytecode decoder 130 may be subject to a 
pseudo random signal which selectively enables and disables this element so as to 
effectively switch the instruction execution mechanism for the Java bytecodes 
between a mixed hardware and emulation execution mechanism and a purely 
emulation mechanism. Configuration controlling values within a system 
configuration register 136 specify whether or not the Java decoder 130 is present and 
whether or not random enabling and disablmg of this Java decode 130 is pemiitted. 

Figure 14 schematically illustrates the handling of a received Java bytecode. 
At step 138 a Java b3rtecode is received. Step 140 determines whether or not the Java 
decoder 130 is enabled. The pseudo random enabling and disabling of the Java 
decoder 130 effectively causes a branch to either step 142 at which the bytecode is 
always emulated or an attempt to execute the instruction in hardware at step 146. 
This obscures/masks the power signature associated with Java bytecode execution. If 
the determination at step 146 is that the particular Java bytecode concerned is not 
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supported by the Java decoder 130, then this Java bytecode will also be emulated in 
software at step 142. HoweveTi if the Java bytecode is supported in hardware, then it 
is executed in hardware at step 146. 
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CLAIMS 

5 1 , Apparatus for processing data, said apparatus comprising: 

a result data value generating circuit operable to generate a result data value 
upon execution of a data processing operation; and 

a data processing register to which said result data value is written; wherein 
at least one data processing operation executed by said result data generating 
10 circuit is a conditional write data processing operation for which a result data value is 
not written to said data processing register when non-write conditions are met; and 
fluther comprising 

a trash register to which a result data value may be written upon execution of 
said conditional write data processing operation when said non-write conditions are 
15 met. 

2. Apparatus as claimed in claim 1, comprising a register bank having a plurality 
of data registers to which result data values are written. 

20 3. Apparatus as claimed in any one of claims 1 and 2, wherein said result data 
generating circuit is part of a processor core responsive to data processing instructions 
specifying data processing operations to be performed. 

4. Apparatus as claimed m claim 3, v^^erein said data processing instructions 
25 include one or more conditional data processing instructions and said processor core 

does not executed conditional data processing instructions when predetermined 
conditions are detected. 

5. Apparatus as claimed in claim 4, wherein if a conditional data processing 

30 instruction includes a write operation and said predetermined conditions are detected, 
then said processor core performs a write operation to said trash register. 

6. Apparatus as claimed in any one of the preceding claims, wherein writing to 
said trash register is programmably disabled by a trash register control signal. 
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7. " Apparatus as claimed in claim 6, wherein said trash register control signal is 
stored in a system configuration register. 

8. Apparatus as claimed in clahn 2, wherein said trash register is part of said 
register bank, said trash register being unmapped to a register number such that said 
trash register may not be specified by a register specifying operand value. 

9. A method of processing data, said method comprising the steps of: 
generating a result data value upon execution of a data processing operation, at 

least one data processing operation executed being a conditional write data processing 
operation, wherein 

• a result data value is not written lo a data processing register when non-write 
conditions are met but is instead written to a trash register. 

10. A method as claimed in claim 9, wherein said data processing register is part 
of a register bank having a plurality of data registers to which result data values are 
written. 

11. .A method as claimed in any one of claims 9 and 10, wherein said method is 
performed by of a processor core responsive to data processing instructions specifying 
data processing operations to be performed. 

12. A method as claimed in claim 1 1, wherein said data processing instmctions 
include one or more conditional data processing instructions and said processor core 
does not executed conditional data processing instructions when predetermined 
conditions ^e detected. 

13. A method as claimed in claim 12, wherein if a conditional data processing 
instruction includes a write operation and said predetermined conditions are detected, 
then said processor core performs a write operation to said trash register. 

14. A method as claimed in any one of claims 9 to 13, wherein writing to said 
trash register is programmable disabled by a trash register control signal. 

17 
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15, A method as claimed in claim 14, wherein said trash register control signal is 
stored in a system configuration register. 

16. A method as claimed in claim 10, wherein said dummy register is part of said 
register bank, said trash register being unmapped to a register number such that said 
trash register may not be specified by a register specifying operand value. 
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ABSTRACT 

PttOnrcSSING ACnVITY MA SiQNG IN A 
DATA PROrFSSTNG SYSTEM 

Within a data processing systems supporting conditional write processing 
operations, a trash register is provided such that when non-write conditions are 
encountered a register write is made to the trash register rather than the data register 
specified by the conditional write operation. Thus the power signature associated 
with whether or not a register write does or does not occur is masked. The trash 
register activity may be programmable enabled and disabled by a configuration 
parameter stored within a system configuration register. 

Figure 12 
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